Catalogo dei prodotti della ricerca

elevant existing features and rules (proposed by Academia and Media) for anomalous Twitter accounts detection. Second, we create a baseline dataset of verified human and fake follower accounts. Such baseline dataset is publicly available to the scientific community. Then, we exploit the baseline dataset to train a set of machine-learning classifiers built over the reviewed rules and features. Our results show that most of the rules proposed by Media provide unsatisfactory performance in revealing fake followers, while features proposed in the past by Academia for spam detection provide good results. Building on the most promising features, we revise the classifiers both in terms of reduction of overfitting and cost for gathering the data needed to compute the features. The final result is a novel Class A classifier, general enough to thwart overfitting, lightweight thanks to the usage of the less costly features, and still able to correctly classify more than 95% of the accounts of the original training set. We ultimately perform an information fusion-based sensitivity analysis, to assess the global sensitivity of each of the features employed by the classifier. The findings reported in this paper, other than being supported by a thorough experimental methodology and interesting on their own, also pave the way for further investigation on the novel issue of fake Twitter followers.

Fame for sale: efficient detection of fake Twitter followers / Cresci, Stefano; DI PIETRO, Roberto; Petrocchi, Marinella; Spognardi, Angelo; Tesconi, Maurizio. - In: DECISION SUPPORT SYSTEMS. - ISSN 0167-9236. - 80:(2015), pp. 56-71. [10.1016/j.dss.2015.09.003]

Fame for sale: efficient detection of fake Twitter followers

Cresci, Stefano;DI PIETRO, ROBERTO;Petrocchi, Marinella;SPOGNARDI, Angelo;Tesconi, Maurizio

2015

Abstract

elevant existing features and rules (proposed by Academia and Media) for anomalous Twitter accounts detection. Second, we create a baseline dataset of verified human and fake follower accounts. Such baseline dataset is publicly available to the scientific community. Then, we exploit the baseline dataset to train a set of machine-learning classifiers built over the reviewed rules and features. Our results show that most of the rules proposed by Media provide unsatisfactory performance in revealing fake followers, while features proposed in the past by Academia for spam detection provide good results. Building on the most promising features, we revise the classifiers both in terms of reduction of overfitting and cost for gathering the data needed to compute the features. The final result is a novel Class A classifier, general enough to thwart overfitting, lightweight thanks to the usage of the less costly features, and still able to correctly classify more than 95% of the accounts of the original training set. We ultimately perform an information fusion-based sensitivity analysis, to assess the global sensitivity of each of the features employed by the classifier. The findings reported in this paper, other than being supported by a thorough experimental methodology and interesting on their own, also pave the way for further investigation on the novel issue of fake Twitter followers.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2015
			
	Parole chiave
	
				Anomalous account detection; Baseline dataset; Fake followers; Machine learning; Twitter; Management Information Systems; Information Systems; Developmental and Educational Psychology; Arts and Humanities (miscellaneous); Information Systems and Management
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Fame for sale: efficient detection of fake Twitter followers / Cresci, Stefano; DI PIETRO, Roberto; Petrocchi, Marinella; Spognardi, Angelo; Tesconi, Maurizio. - In: DECISION SUPPORT SYSTEMS. - ISSN 0167-9236. - 80:(2015), pp. 56-71. [10.1016/j.dss.2015.09.003]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Cresci_Fame_2015.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.29 MB Formato Adobe PDF Contatta l'autore	1.29 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/960191

Citazioni

ND

300

199

social impact